High performance logistic regression for privacy-preserving genome analysis
نویسندگان
چکیده
Abstract Background In biomedical applications, valuable data is often split between owners who cannot openly share the because of privacy regulations and concerns. Training machine learning models on joint without violating a major technology challenge that can be addressed by combining techniques from cryptography. When collaboratively training with cryptographic technique named secure multi-party computation, price paid for keeping private an increase in computational cost runtime. A careful choice techniques, algorithmic implementation optimizations are necessity to enable practical over distributed sets. Such tailored kind Machine Learning problem at hand. Methods Our setup involves two-party computation protocols, along trusted initializer distributes correlated randomness two computing parties. We use gradient descent based algorithm logistic regression like model clipped ReLu activation function, we break down into corresponding protocols. main contributions new protocol function requires neither comparison protocols nor Yao’s garbled circuits, series engineering improve performance. Results For our largest gene expression set, train 7 billion multiplications; completes about 26.90 s local area network. The this work further optimized version which won first place Track 4 iDASH 2019 genome analysis competition. Conclusions paper, present its implementation, subprotocol securely compute function. To best knowledge, fastest existing high dimensional across
منابع مشابه
Privacy-Preserving Logistic Regression
Logistic regression is an important statistical analysis methods widely used in research fields, including health, business and government. On the other hand preserving data privacy is a crucial aspect in every information system. Many privacy-preserving protocols have been proposed for different statistical techniques, with various data distributions, owners and users. In this paper, we propos...
متن کاملPrivacy-preserving logistic regression
This paper addresses the important tradeoff between privacy and learnability, when designing algorithms for learning from private databases. We focus on privacy-preserving logistic regression. First we apply an idea of Dwork et al. [6] to design a privacy-preserving logistic regression algorithm. This involves bounding the sensitivity of regularized logistic regression, and perturbing the learn...
متن کاملPrivacy Preserving Regression Residual Analysis
Regression analysis is one of the most basic statistical tools for generating predictive models that describe the relationship between variables. Once a model has been generated, numerous goodness-of-fit measures are used to evaluate the degree to which the model characterizes the relationship between the variables under consideration. The analysis of regression residuals is one such measure, w...
متن کاملPrivLogit: Efficient Privacy-preserving Logistic Regression by Tailoring Numerical Optimizers
Safeguarding privacy in machine learning is highly desirable, especially in collaborative studies across many organizations. Privacy-preserving distributed machine learning (based on cryptography) is popular to solve the problem. However, existing cryptographic protocols still incur excess computational overhead. Here, we make a novel observation that this is partially due to naive adoption of ...
متن کاملPrivacy-Preserving Regression Algorithms
Regression is arguably the most applied data analysis method. Today there are many scenarios where data for attributes that correspond to predictor variables and the response variable itself are distributed among several parties that do not trust each other. Privacy-preserving data mining has grown rapidly studying the scenarios where data is vertically partitioned. While algorithms have been d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: BMC Medical Genomics
سال: 2021
ISSN: ['1755-8794']
DOI: https://doi.org/10.1186/s12920-020-00869-9